Parameterization and automatic labeling of Hungarian intonation
نویسندگان
چکیده
In Hungarian intonation research the goal of a common framework developed by Varga (2002; [1]) is to categorize the intonation within the domain of accent groups by character contours. We propose a linear parameterization of a subset of these contours derived from polynomial stylization. These parameters were used to train classification trees and support vector machines for contour prediction. Parameter extraction and training was carried out on the original F0 contours of spontaneous speech data as well as on three differently normalized variants suppressing fundamental frequency level and range effects. The highest accuracies were obtained for classification trees and F0 residuals after midline subtraction, but the overall performances were rather poor. Nevertheless, a significant improvement of the results was achieved by a Hidden Markov model to predict the correct label sequence from the partly erroneous classification output.
منابع مشابه
Parameterization of prosodic headedness
Prosodic headedness generally refers to the location of relevant prosodic events at the left or right end of prosodic constituents. In a bottom-up procedure based on a computational F0 stylization we tested several measures to quantify headedness in parametrical and categorical terms for intonation in the accentual phrase (AP) domain. These measures refer to F0 level and range trends as well as...
متن کاملAutomatic Labeling of Intonation Using Acoustic and Lexical Features
This paper proposes a framework of automatic intonation labeling which involves detection and classification of pitch accents and phrase boundaries. Four statistical models are designed to perform these tasks on the basis of a compact and simple representation consisting of features identified as the main acoustic correlates of accentual prominence and phrase boundaries or describing the acoust...
متن کاملA 75-year-old Hungarian spontaneous speech database
The first attempt to develop a large collection of recorded speech material in Hungarian was made by the phonetician Lajos Hegedűs in the 1940s. He wanted to preserve the sounding of the various Hungarian dialects in the country and even outside Hungary with the purpose of analyzing, among other things, the intonation, pauses and rhythm of speech in those dialects. This paper introduces the pro...
متن کاملProsody Annotation for Unit Selection Tts Synthesis
This paper concerns prosody annotation and intonation modeling, especially for the application in a corpus based speech synthesis. In order to establish the rules of the automatic intonation modeling, a four hour fully annotated speech database has been acoustically and perceptually analyzed. The speech material included different text types, dialogs and prosodically rich phrases. As the result...
متن کاملProsody annotation for corpus based speech synthesis
The paper concerns prosody annotation especially for application in a corpus based speech synthesis. In order to establish the rules of automatic intonation modelling, phonetically labeled speech database of 4 hours has been perceptually and acoustically analyzed. The speech material included different text types and prosodically rich phrases. The annotation of the speech database consists in p...
متن کامل